Generating frequent itemsets incrementally: two novel approaches based on Galois lattice theory
نویسندگان
چکیده
Galois (concept) lattice theory has been successfully applied to the resolution of the association rule problem in data mining. In particular, structural results about lattices have been used in the design of efficient procedures for mining the frequent patterns (itemsets) in a transaction database. As transaction databases are often dynamic, we propose a detailed study of the incremental aspects in lattice construction to support effective procedures for incremental mining of frequent closed itemsets (FCIs). Based on a set of descriptive results about lattice substructures involved in incremental updates, the paper presents a novel algorithm for lattice construction that only explores limited parts of a lattice for updating. Two new methods for incremental FCI mining are studied: the first one inherits its extensive search strategy from a classical lattice method, whereas the second one applies the new lattice construction strategy to the itemset mining context. Unlike batch techniques based on FCIs, both methods avoid rebuilding from scratch the FCI family whenever new transactions are added to the database and/or when the minimal support is changed.
منابع مشابه
Fast Algorithms for Mining Generalized Frequent Patterns of Generalized Association Rules
Mining generalized frequent patterns of generalized association rules is an important process in knowledge discovery system. In this paper, we propose a new approach for efficiently mining all frequent patterns using a novel set enumeration algorithm with two types of constraints on two generalized itemset relationships, called subset-superset and ancestor-descendant constraints. We also show a...
متن کاملClosed Set Based Discovery of Small Covers for Association Rules
In this paper, we address the problem of the usefulness of the set of discovered association rules. This problem is important since real-life databases yield most of the time several thousands of rules with high confidence. We propose new algorithms based on Galois closed sets to reduce the extraction to small covers (or bases) for exact and approximate rules, adapted from lattice theory and da...
متن کاملA Frequent Closed Itemsets Lattice-based Approach for Mining Minimal Non-Redundant Association Rules
There are many algorithms developed for improvement the time of mining frequent itemsets (FI) or frequent closed itemsets (FCI). However, the algorithms which deal with the time of generating association rules were not put in deep research. In reality, in case of a database containing many FI/FCI (from ten thousands up to millions), the time of generating association rules is much larger than t...
متن کاملInterestingness measures for association rules: Combination between lattice and hash tables
There are many methods which have been developed for improving the time of mining frequent itemsets. However, the time for generating association rules were not put in deep research. In reality, if a database contains many frequent itemsets (from thousands up to millions), the time for generating association rules is more longer than the time for mining frequent itemsets. In this paper, we pres...
متن کاملA framework for incremental generation of closed itemsets
Association rule mining from a transaction database (TDB) requires the detection of frequently occurring patterns, called frequent itemsets (FIs), whereby the number ofFIsmay be potentially huge. Recent approaches forFImining use the closed itemset paradigm to limit themining effort to a subset of the entireFI family, the frequent closed itemsets (FCIs).We show here howFCIs can bemined incremen...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- J. Exp. Theor. Artif. Intell.
دوره 14 شماره
صفحات -
تاریخ انتشار 2002